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Abstract — Low-density parity-check (LDPC) codes have been 
used for communication over a two-user Gaussian broadcast 
channel. It has been shown in the literature that the optimal 
decoding of such system requires joint decoding of both user 
messages at each user. Also, a joint code design procedure should 
be performed. We propose a method which uses a novel labeling 
strategy and is based on the idea behind the bit-interleaved coded 
modulation. This method does not require joint decoding and/or 
joint code optimization. Thus, it reduces the overall complexity 
of near-capacity coding in broadcast channels. For different rate 
pairs on the boundary of the capacity region, pairs of LDPC 
codes are designed to demonstrate the success of this technique. 

I. Introduction 

The problem of simultaneous communication of a single 
source to multiple receivers, which is known as the broadcast 
channel, was first introduced by Cover in [1]. So far, the 
capacity region of certain classes of broadcast channels are 
known; however, the capacity region of a broadcast channel 
in general is still unknown. 

Based on the achievable rate region given in [2], Berlin 
and Tuninetti in [3] studied the code design problem for a 
two-user fading Gaussian broadcast channel. They used low- 
density parity-check (LDPC) codes as the coding framework 
and studied the optimal decoding procedure. In fact, they 
showed that using superposition encoding and joint decoding, 
close-to-capacity LDPC codes can be found at low signal-to- 
noise ratios (SNRs). In their optimal scheme, since the user 
messages are superimposed, a joint factor graph is formed. 
Moreover, to obtain the maximum a posteriori (MAP) estima- 
tion of bits, the message updating rules impose mapper nodes. 
These mapper nodes connect the Tanner graphs associated 
with each user's code to act as interference cancelers. These 
mapper nodes not only increase the decoding complexity, they 
also require both users to have the codebook of each other 
and perform joint decoding. In addition, for near capacity 
performance, the codes should be jointly optimized. Most 
LDPC code design techniques are based on a search in the 
space of code parameters. The complexity of such search- 
based code design techniques increases significantly with the 
number of design parameters. A joint code optimization means 
that the number of design parameters is almost doubled. 

In this paper, motivated by the bit-interleaved coded mod- 
ulation (BICM) scheme [4], we propose a suboptimal scheme 
which employs a novel labeling method in order to remove the 



mapper nodes. Therefore, each user can use its own LDPC 
code. In other words, without the mapper nodes, there is 
no need for joint decoding; the users do not need to have 
the codes of each other and joint optimization of codes is 
not necessary. Our numerical results show that the proposed 
method performs close to the optimal solution of [3] (albeit 
with a lower decoding complexity.) 

Since LDPC decoders usually use the log-likelihood ratio 
(LLR) values, we study the properties of LLRs based on the 
proposed method by a discussion on computing the probability 
density function (pdf) of LLRs. 

In Section [II] we briefly review the main results known for 
broadcast channels. We discuss using LDPC codes for a two- 
user Gaussian broadcast channel in Section [III] Our method is 
proposed in Section [IV] and LDPC codes based on our method 
are designed in Section [V] Section [VI] concludes the paper. 

II. Background 

A two-user broadcast channel consists of an input alphabet 
X, two output alphabets y and Z, and a set of channel 
transition probabilities p(y, z\x) where (x,y,z) 6 X x y x Z. 
A (2 nRy , 2 nR ~ , n) broadcast code consists of two equiprobable 
message sets W y = {1, 2, . . . , M y } and W z = {1, 2, . . . , M z } 
where M y = 2 nRy and M z = 2 nRz , a codebook which has 
My x M z codewords of length n and symbols from the input 
alphabet X, and two decoders which assign two message 
indices w y (y n ) S W y and w z (z n ) G W z to each received 
observation pair (y n ,z n ) [1]. 

The goal is to send private messages to both receivers 
with a vanishing probability of error. The users can have 
a common message; however, in this work, we are only 
interested in the private messages. Two private messages are 
drawn independently from two message sets W y and W z , 
and then the corresponding codeword is transmitted over the 
broadcast channel. 

A pair of rates (R y ,R z ) is said to be achievable if there 
exists a (2 nRy , 2 nRz , n) broadcast code with vanishing average 
probabilities of error at both of the receivers, as n — > oo 
[1]. The capacity region of a broadcast channel is the convex 
closure of all the achievable rates and it depends only on the 
marginal densities, i.e., p(y\x) and p(z\x) [1]. 

The single-letter characterization of the capacity region 
of a general broadcast channel is still unknown. In special 
cases, however, the capacity region is known. Here, we 



confine our attention to degraded broadcast channels. If the 
broadcast channel transition probability can be factorized 
as p(y,z\x) — p(z\y)p(y\x), then the broadcast channel is 
physically degraded which implies that X, Y, and Z forms a 
Markov chain, i.e., X — > Y — >■ Z [5]. In other words, user 
Z receives a more degraded version of X than user Y. We 
denote the convex hull by CH. Bergmans [6] proved that the 
capacity region of a degraded broadcast channel X — > Y — ► Z 
is the set of rates (R y ,R z ) such that 



CH 

p(v)p(x\v) 



Ry, R Z 



> 



Rz <I{V;Z) 
R y <I(X;Y\V) 



(1) 



where V is an auxiliary random variable whose support set 
V satisfies |V| < min{|«¥|, \y\, \Z\}. The idea is that the 
auxiliary random variable V serves as a cloud center (cloud 
of codewords) distinguishable by both receivers [6]. There 
are in total M z clouds available and each cloud contains M y 
codewords. The "weaker" user, i.e., Z, can only see the clouds 
while the user Y can also see codewords within a cloud. In 
fact, user Y first strips off the message of user Z (decodes 
the cloud) and then can see the individual codewords within 
a cloud [6], This method is called superposition coding. 

A. Gaussian Broadcast Channels 

The focus of this work is on the Gaussian broadcast 
channels which are defined as [3] 



Y = AX + Ny 
Z = BX + N Z 



(2) 



where the additive white Gaussian noises are zero mean and 
have variance No, i.e., N y ,N z ~ Af(0, No), independent from 
the input X which is power constrained by E(|X| 2 ) < P. 
Also, A and B are two ergodic memoryless processes, known 
at the receivers. In general, the broadcast channel given in 
d2J is neither degraded nor more capable [2]. However, if the 
fading processes are constant (unfaded Gaussian) and |A| > 
\B\ then (0 will be degraded and the capacity region according 
to (Q} is given by [3] 



(J <Ry,R Z >0 
ag[0,l] I 



R y < C(a|A| 2 7 ) 

R z < C(\B\ 2 -y) - C(a|B| 2 7 ) 



where C(x) — \ log 2 (l + x) and 7 = The boundary of 
this region is achieved by 



X = 



(3) 



where E(\X\ 2 ) = P and a = 1 - a for a G [0,1]. Also, 
a represents the fraction of power allocated for user Y, and 
(U, V) ~ N (0,1.2) is a P a ir of independent normal random 
variables [7]. 

Since the Gaussian input given in (O cannot be used in 
practice, Berlin and Tuninetti in [3] consider the performance 
achievable by a binary linear codebook instead of the Gaussian 



codebook. This means that (U, V) in ^ are now drawn uni- 
formly from {— 1, +1} x {— 1, +1}. In this case, the capacity 
region is given by [3] 




Ry j Rz 



> 



R y < J(a\A\ 2 1 ) 
R z <J(\B\ 2 1 )-J(a\B\ 2 1 ) 



where J(t) = 1 - E M log 2 (l + e~ M ) and M ~ Af(t/2, t). 

We use LDPC codes in this work as the binary code to 
communicate over the Gaussian broadcast channel. 

B. LDPC Codes 

In this paper, following the notation of [8], an ensemble 
of LDPC codes is defined by a pair of distributions (A, p) 
in the polynomial form, i.e., X(x) = J2i>2 ^j^' -1 ' an d 
p( x ) = 53i>2 Pi xl _1, Throughout this work, we will use 
appropriate subscripts to distinguish the codes of different 
users. Transmission of LDPC codes takes place on a memory- 
less binary-input symmetric-output (BISO) channel. Under the 
sum-product decoding and the all-one codeword transmission 
assumption [8], a BISO channel is completely characterized 
by the pdf of its LLR messages, denoted by a c h(x) where for 
all x, we have a c h(— x) = a c h(x)e~ x . 

By code design for a fixed channel, we mean optimizing 
the polynomials A(a;) and p(x) in order to obtain an LDPC 
code with the highest code rate which converges to zero error- 
rate on the given channel. Similar to [8], we fix p(x) so 
that the optimization problem can be formulated as a linear 
programming. 

III. LDPC Coding for Gaussian Broadcast 
Channels 

For LDPC coding on the broadcast channel, as suggested 
in [3], one can pick two LDPC codes from the ensembles 
(\y,p y ) and (\ z ,p z ). Denoting codewords of length n by 
boldface letters, the superimposed transmitted vector x can 
be written as 



where x y ,x 



x = VaPxy + VaPx z (4) 

G {— are the binary codewords of the 
users Y and Z, respectively. Also, by transmission of x, we 
observe two vectors y and z. 

The factor graph associated with the MAP estimate of Xi tV , 
the ith bit of the binary vector x y , is given by [3] where the 
function node connecting the two Tanner graphs is called the 
mapper node. The mapper node increases the complexity in 
the following ways: 

• The mapper node forces both users to have the code of 
each other in order to jointly decode the codewords. 

• The decoding complexity increases considerably. This is 
because at each iteration of the message passing decoder, 
the messages from one Tanner graph should be passed to 
the other graph to enhance the reliability of decisions. 

• The code design stage becomes more cumbersome than 
in the single-user case as the codes must be optimized 
jointly. 

'We use the conventional mapping 1— » 1 and 1 h -1. 



IV. A Disjoint LDPC Coding Scheme 



to be 



In this section, we propose a method to use LDPC codes 
over a two-user Gaussian broadcast channel based on a novel 
adaptive labeling method and interleaving the bits. The main 
goal is to allow for disjoint decoding of messages and the 
idea is motivated by BICM. Thus, a brief discussion on BICM 
which leads to our main idea is presented. 

A. Bit-Interleaved Coded Modulation 

BICM is a bandwidth-efficient coding method [4] which is 
based on the concatenation of a binary code, a bit-interleaver 
and a high-order modulation [9]. The coded bits in BICM 
are interleaved and then every d — log 2 D bits are grouped 
together and sent over the channel using a D-ary constellation. 
At the receiver, after computing the LLR values of the coded 
bits and de-interleaving, a binary decoder is used as if the LLR 
values were the observations at a binary phase shift keying 
channel output [9]. 

Now, consider two labeling methods: Gray labeling and 
binary set partitioning [10] (Ungerboeck) labeling. In Gray 
labeling, the label of each point of the constellation differs 
from its neighbors only in one bit. Ungerboeck labeling 
partitions the constellation such that each bit has a different 
level of protection and the received symbols have to be 
decoded sequentially. It has been shown [4] that if we use 
Gray labeling for the D-ary constellation, then the capacity 
of BICM is extremely close to the capacity of the optimal 
receiver. Using binary set partitioning (Ungerboek labeling) 
and BICM, the capacity is far from the optimal decoding since 
the BICM receiver considers that the bits are independent 
while the optimal receiver exploits the dependency between 
the successive bits. 

An important result of [4] is that using a bit interleaver 
and Gray labeling, a binary decoder can be used to get a 
performance which is almost the same as the optimal receiver. 
Another way of understanding this result is that the depen- 
dency among the label bits in a Gray-labeled constellation is 
minor, allowing the decoder not to exploit the dependency and 
still provide near-optimal performance. 

B. The Proposed Method 

Let us have a look at the superimposed codeword given 
in The transmitted symbol X is selected from the set 
X = {±VaP ± \/aP} that can be viewed as a mapping 
which maps two independent bits to a point in a 4-PAM-like 
constellation shown in Table J] This mapping uses a binary 
labeling method and it should be emphasized that depending 
on the value of a, we can have different configurations. In 
the first column of Table H] the most significant bit position 
represents X z and the other bit represents X y . 

Now, consider two sequences of LDPC coded bits, each of 
which is intended for one of the users. We utilize the fact that 
LDPC codes are self-interleaved and apply Gray labeling for 
the 4-PAM-like constellation. Table U shows bit configurations 
for this scheme. When a > 5, the transmitted codeword has 



X = 



(5) 



in order to have Gray labeling. Since X y and X z are indepen- 
dent, zero-mean and unit-variance binary random variables, we 
obtain E(\X\ 2 ) = aPE{\X y \ 2 ) + aPE(\X z \ 2 )E{\X v \ 2 ) = P 
which shows that the power constraint is satisfied. 

To maintain Gray labeling for a < ~, as it is shown 
in Table U the position of bits has to be interchanged. The 
corresponding transmitted codeword is 



X 



'aPX y X z 



(6) 



which satisfies the power constraint as E(|X| 2 ) = 
aPE(\X y \ 2 )E(\X z \ 2 ) + aPE(\X z \ 2 ) = P. 

Remark 1: It is noteworthy that the two parts of (0 and 
(|6]l are not independent since they share a factor in common. 
Therefore, our method does not exactly match superposition 
coding. Also, in superposition coding, the message intended 
for the weaker user identifies the cloud centers, no matter 
how much power we allocate to the weaker user. From ©, 
i.e., for a > |, the cloud center is not identified by the 
message of user Z, i.e., X y X z . Therefore, the region obtained 
based on the proposed method may not be convex. This is 
because we adaptively force the labeling to be Gray, leading to 
a mismatch between the proposed method and superposition 
coding. However, we will see in Section [V] that the region 
based on our method covers most of the region given in [3], 

▲ 

As mentioned, by using Gray labeling we have reduced 
the dependency among the label bits. Interleaving removes 
the dependency altogether to validate our decoding approach. 
Moreover, interleaving does not incur a significant loss since 
the dependency among the bits was indeed minor due to 
the labeling scheme. An optimal decoder should still use the 
existing dependency, but the performance gain will be minor. 
In binary labeling, the dependency must be used (that is what 
the mapper node does) because it is too strong. 

To analyze the proposed method, let us determine the 
capacity region using our method. For a > h, we have 



R Z <I{V;Z) 



p(z\x z 



-dz 



= i-i/ pW x, = + i)io g2 ( 1 + ^f^llu 2 - 



and 



Ry<I(X;Y\V) 



H(Y\X 
H(Y\X 
1 
2 



H(Y\X,X Z ) 
H(AX + N y \X, X z ) 

[H(Y\X Z = +1) + H(Y\X Z = -I)] 



log 2 (2^e7Vo) 



TABLE I: Comparison of binary and Gray labeling methods where the symbol P is removed for simplicity. Note that for 
binary labeling, we always have X = \l aPX y + %/ aPX z . 



Binary labeling 


Gray labeling 


a S [0, 1] 


X Z Xy 

\ / 

11 01 10 00 


X = V^PXy + \/&PX Z X y Xz Xy 

\ / 

01 11 10 00 


a > \ 


1 . . ► 

— \fa — \J~6l — \fa yfa — \J~6l ^fu + \fEt 


— . . — 1 — . . - 

— \fa~\fdt \fEi — y/a \fot — y/& \fa. + y/~8t 


X Z Xy 

\ J 

11 10 01 00 


X = V^PX y X Z + VtiPX Z Xy X Z 

\ / 

01 11 10 00 




1 

— \fot — i/a — ~ \f®- y/a + y/K 


— . . — 1 — . . - 



where p{z\x z = +1) is a mixture of two Gaussian pdfs. In a 
similar manner, we can derive formulas for a < \. Finally, 
the region based on the proposed method is given by ([T). 

C. LLR Computation and Its Properties 

Usually LDPC decoders use LLR values. In this section 
we study some of the properties of LLRs for our proposed 
solution. In particular, we see that the LLR pdf does not satisfy 
the symmetry required for the all-one codeword assumption. 
We first describe how to obtain the LLRs for each of the users. 
For user Y and a > i, according to Table|I] the LLR message 
received from the channel p(y\x y ) is 



lOg; 

log 2 
log 2 



p(y\X y = +1) 
2 p(y\X y = -1) 

Ex a v{xz)v{y\x y = +l,x z ) 



T, Xz p(xz)p(y\X y = -l,x z ) 
n A {+i,+i) + n A (+i,-i) 



n A (-i,-i) 



-i,+D 



i 



exp l -Kirriv - a ^P(pV^ + Qy^)Y 



where 

and p,q <E {±1}. For a < |, we obtain 

p{y\X y = +1) 



m „ 



log 2 
log 2 



p(y\X y = -1) 

n A (+i,+i) + n A (- 



i.-i) 

n A (-i,+i) + n A (+i,-i)' 

Similarly, for user Z, we get 

O b (+1,+1) + O b (-1,-1) 



and 



m 2 



log 2 



loe 



n B (+i,-i) 
n B (+i,+i) 



-fi B (-i,+i) 
•n B (-i,+i) 



n B (-i,-i) + n B (+i,-i) 



for a > i and a < |, respectively. 

Lemma 1: The pdfs p(y\x y ) and p(z\x z ) are not symmetric. 
Proof: We have 



p(y\x y ) 



^p{xz)p{y\xy,x z ) 



= -j[p{y\x 



X z 



+l)+p(y\x y ,X z 



-I)]- 

n A (-i,-i) 



\, we get p{y\X y = -1) = 
1)] = p{~ v\X y = +1), but this does not hold 
i since p(y\X y = -1) - i[fi A (-l,+l) + 
1). This result can be extended 



For a > 

^(-1,4 
for a < 

n A (+l,-l)] ^ P (~y\X. 

to p(z\x z ). ■ 

Let us denote the pdf of m y by ach^m). Since the channel 
p(y\x y ) is not symmetric, the all-one codeword assumption for 
the purpose of density evolution [11] is not valid. However, 
according to [12], a symmetrized LLR pdf can be obtained 
using 



a c h, a (™) = ^[sL ch , y (m\Xy = +1) 



ach, v (-m\X y = -1) 



Note that if the channel was symmetric, the above equation 
would lead to ^(m) = a C h }y (m\X y = +1), resulting in the 
all-one codeword assumption. Similarly, we denote the LLR 
pdf of user Z by a c h, z (m). 

V. Simulation Results and Code Design 

In this section, we compare the regions resulting from a 
Gaussian input, binary labeling method from [3] and our 
proposed method. Then, we design LDPC codes based on our 
labeling method. 

Consider a two-user Gaussian broadcast channel given in 
© where \A\ 2 j = 5.059 dB, and \B\ 2 ~f = 3.871 dB. We 
choose these values to have a fair comparison with the region 
given in [3], In Fig.Q] we compare the capacity region when 
the optimal Gaussian input is used, the optimal region given 



SNR = 5.059 dB.SNR = 3.871 dB 

y z 




0.2 0.4 0.6 0.8 1 1.2 1.4 

R y [bpcu] 



Fig. 1: Comparison of the capacity region of a two-user 
Gaussian broadcast channel with different inputs. The cross 
points show the achieved rates by the proposed method given 
in Table El and Table El] 



TABLE II: Optimized degree distributions for user Y. 



a 


0.1 


0.2 


0.3 


0.8 


A2 , A3 


0.276,0.215 


0.263, 0.233 


0.190, 0.201 


0.162, 0.194 


A 5 , A 6 




0.000,0.009 


0.007, 0.029 


0.007,0.035 


A7, As 


0.012, 0.210 


0.144, 0.071 


0.055, 0.045 


0.065,0.036 


A9 j A10 


0.012, 0.020 


0.014,0.007 


0.042, 0.044 


0.030,0.033 


An , A12 




0.005,0.029 


0.036, 0.021 


0.037,0.031 


A13 j A14 






0.012, 0.007 


0.020, 0.012 


A15 , Ai6 






0.005, 0.036 


0.007,0.033 


A19 




0.006 






A20 j A21 




0.007,0.009 






A22 , A23 




0.011, 0.014 






A24, A25 




0.018, 0.022 






A26 , A27 




0.023, 0.022 






A28 J A29 




0.018, 0.014 






A30, A31 




0.011, 0.008 






A32 J A33 




0.006,0.036 






A48 J A49 






0.000, 0.009 


0.025,0.010 


A50 


0.249 




0.261 


0.263 


P5, P6 


1, 


0.173, 0.827 






PS, P9 






0.471,0.529 




P14, P15 








0.483,0.517 


Rate 


0.187 


0.320 


0.426 


0.637 



in [3], and the region based on our method in Section [IV] 
It can be seen that most of the optimal region is covered 
by our proposed method. As we discussed in Remark 1, 
the region based on our method is not convex. Since the 
proposed method does not require joint decoding, for each user 
a separate LDPC code can be optimized using the conventional 
techniques in the literature. In our numerical optimizations, we 
use the sum-product discrete density evolution established by 
Chung in [13], where the maximum LLR is set to 25 and a 9- 
bit quantizer is used. We allow codes with maximum variable 
node degree of 50 converging in at most 800 iterations to a 
target bit error rate of 10 -6 . It is noteworthy that the LLR pdf 
for each user is obtainable using the discussion in Section [TV] 



TABLE EH: Optimized degree distributions for user Z. 



a 


0.1 


0.2 


0.3 


0.8 


A2 j A3 


n 1/18 n 1 Q7 
u. i^o, u. iy 1 


u. ioy , u.zuu 


U.Zoo, U.ZO I 


n 9Q9 n 9^^ 
u . zyz , u. zoo 


A5 , Ae 


n nn7 n n97 

u . UU ' j U . UZ 1 


n nri7 n no a 

U.UU 1 , U.UZ4: 


n nnn n m i 
u. uuu , u . u 1 1 


n nnn n nno 
u.uuu, u.uuy 


\ _ \ r , 

^7 ; ^8 


n riR7 n n/17 


n n^7 n n^n 
U.UO 1 , u.uou 


n 1 os n no^ 
u. iyo, u.uzo 


n nofi n 1 m 
u.uyo, u. lui 


Ag ■ ^10 


n n n - 39 


U.U4o, U.U^U 


n nns n nn^ 
u.uuo, u.uuo 


n ni s n nno 
u.uio, u.uuy 


An j A 12 


n n^n n nofi 


n n n9^ 

U.UOO, U.UZo 




n nnfi n nn^ 
u.uuu, u.uuo 


A13 , Al4 


n mo n H1Q 
u.uiy, u.uio 


n ni a n nno 
u.ui4t, u.uuy 






A15 , Aie 


n nno n nrifi 
u.uuy, u.uuo 


U.UUD, U.UOO 


U.U4K) , U.UU 1 




A17, Alg 


n nn^ n n-10 




n nn7 n nno 
u.uu 1 , u.uuy 




Aig , A20 






n n 1 a n n 9 t 


n nfin n nns 

u . UDU , U. UUO 


A21 ) A22 






u.uocs, u.uiy 


n nn^ n nnr; 

U.UUO, U.UUO 


A23 ) A24 






n m7 n n99 
U.UO I , u.uzz 


n nnfi n nn7 

U.UUU, U.UU 1 


^25 j ^26 






0.013, 0.008 


0.008, 0.009 


A27 1 ^28 






0.006, 0.000 


0.011, 0.012 


A29 , A30 








0.012, 0.013 


A3I , A32 








0.012, 0.012 


A33 , A34 








0.011, 0.010 


A35. A36 








0.008, 0.007 


A37. A38 








0.006, 0.006 


A47, A48 


0.005, 0.008 


0.018,0.006 






A49; A50 


0.015, 0.255 


0.011, 0.252 






P4: P5 








0, 1 


P7> PS 






0.273, 0.727 




Pl2 




1 






Pl6, P17 


0.485,0.515 








Rate 


0.672 


0.572 


0.473 


0.244 



In Fig.Q] the achievable rates are shown by the cross points. 
Also, due to the limit of space, some of the optimized degree 
distributions for user Y and Z are reported in Table [II] and 
Table ITTA respectively. 

VI. Conclusion 

In this paper, a low complexity method for communicating 
over a two-user Gaussian broadcast channel based on LDPC 
codes was presented. Compared to the previous works, we 
showed that in our method, each user can use a single LDPC 
code and the need for joint decoding at the receivers is 
eliminated. 
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